Relations between Entropies Produced in Nondeterministic Thermodynamic Processes 
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Landauer's erasure principle is generalized to nondeterministic processes on systems having an 
arbitrary number of non-symmetrical logical states. The condition that the process is applied in 
the same way, irrespective of the initial logical state, imposes some restrictions on the individual 
heat exchanges associated with each possible transition. The complete set of such restrictions are 
derived by a statistical analysis of the phase-space flow induced by the process. Landauer's erasure 
principle can be derived from and is a special case of these. 
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I. INTRODUCTION 

Landauer's erasure principle (LEP) is a fundamental 
result in the thermodynamics of computation. It gives 
a relation between the amount of the information stored 
by a memory device and the average entropy increase in 
the environment when a process erases that information. 
The principle holds for processes that satisfy two essential 
features: 

(A) The process is carried out in the same way indepen- 
dent of the initial logical state (i.e., the information 
stored) or the microstate of the device and 

(B) it restores the device to a known standard state at 
the end. 

The first feature is necessary for a complete erasure of 
information. Although it is conceivable that the process 
can read the information and take different actions de- 
pending on it, this can be done only by recording the 
information somewhere else. In that case, the process 
must also erase the recorded information. Even when 
this is the case, the statement (A) remains valid when 
the recording instrument is considered as part of the de- 
vice. 

Landauer has shown that under both of these con- 
ditions, heats must be dumped to the environment in 
such a way that the average increase of the entropy is at 
least ks In 2 per bit of the information erasedji where ks 
is Boltzmann's constant. The original derivation of the 
principle by Landauer also assumes that the logical states 
of the device are symmetric, i.e., they have equal values 
for the thermodynamical quantities. However, such a re- 
striction is not necessary when the total entropy change 
of both the device and the environment is considered. 
For example, for devices having non-symmetrical logical 
states, it is seen that the main conclusion of LEP con- 
tinues to be valid when a complete write-erase cycle is 
considered^ 

The principle is of prime importance in Bennett's res- 
olution of Maxwell's demon paradox.— In its quest to 
reduce entropy, the demon periodically makes measure- 
ments, records the results into its memory and takes some 



actions depending on these results. As the memory has 
a finite capacity, it must be cleared at some stage and 
reset to some standard state in order to repeat the same 
cycle of events. It is this erasure process and the as- 
sociated entropy increase that exactly offsets the reduc- 
tion of entropy the demon has achieved and thus saves 
the second law. Independently from Landauer and Ben- 
nett, same conclusions have been obtained by Penrose 
in his textbook on statistical mechanics.^ A review of 
Landauer's principle and Maxwell's demon, as well as 
reprints of some key articles can be found in Ref. 0. 

Since Landauer's original derivation of LEP relies on 
the second law where the definition of entropy is ex- 
tended to an ensemble of memory devices, several ob- 
jections have been raised on its validity. This prompted 
the appearance of alternative proofs of the principle that 
does not rely on the second law, by using the Fokker- 
Planck equation^, by using the microstate distribution 
functions^ and by analyzing the phase-space flow the 
erasing process induces^ 

In all of these, the feature (A) of the process is of 
central importance. Since the device and the environ- 
ment are subjected to the same treatment independent 
of the initial logical state, feature (A) is equivalent to the 
statement that the time-dependence of the microstates 
during the process is governed by a single, logical-state- 
independent Hamiltonian. As a result, the microstates 
before and after the process are related by a single map. 
That map is a canonical transformation for classical sys- 
tems, which preserves the phase-space volumes by Li- 
ouville's theorem, and is an isometry for quantum sys- 
tems, which preserves the dimensions of the subspaces 
it is acting to. LEP follows from the constancy of the 
phase-space measures: as the process necessarily reduces 
the phase space of the device by requirement (B), it must 
expand that of the environment which leads to the Lan- 
dauer bound. 

The purpose of this article is to investigate the full 
implications of the feature (A) in the most general set- 
ting, without any additional assumptions. It contains the 
derivation of a complete set of relations that captures all 
restrictions that can be placed on the heats dumped to 
the environment in such processes. When deriving these 
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relations, no further assumptions are made on the process 
and the logical states of the system. The logical states 
might be non-symmetrical and, in addition, the initial 
and final sets of these states might be different both in 
number and in character. Also, the process applied can 
be nondeterministic, i.e., for the same initial logical state, 
it might lead to a set of different final states with known 
transition probabilities. The precise statement that will 
be proved is the following, 

Theorem 1. Suppose that an arbitrary process having 
feature (A) is applied on a system that has n initial logical 
states and brings it to a final configuration with m logical 
states. If P((3\a) denotes the conditional probability that 
the initial logical state a ends up in the final logical state 
[3, and ksspa denotes the total entropy increase of the 
system and the environment when an a to (3 transition 
occurs, then the following inequalities are satisfied 

n 

]T P(/?|a)e- s ^ < 1 G9=l,2,...,m) , (1) 

a=l 

i.e., m inequalities, one for each possible final state (3. 

No additional assumptions other than feature (A) will 
be made in proving this theorem. If the process obtains 
some information about the system and stores it into 
some recording instrument or, in general, an interaction 
occurs with an instrument, then the instrument should 
be considered as part of the system. Apart from this, if 
the initial logical state of the system is recorded elsewhere 
before the process begins, the conclusions of the theorem 
will not change as long as the process does not use this 
information. 

These relations imply LEP as a special case (when fea- 
ture (B) is also imposed), but they go far beyond LEP 
in implications because they cannot be derived starting 
from this principle. It will also be shown that these re- 
lations are complete, i.e., one cannot find any further 
restrictions between the transition probabilities and in- 
dividual entropy increases. Consequently, any process 
with parameters satisfying ([T]) can be constructed in prin- 
ciple. Therefore, these inequalities provide a complete 
description of the thermodynamics of classical informa- 
tion processing by taking nondeterministic logical opera- 
tions into account. Like LEP, these relations arc mainly 
of theoretical interest, but they may find application in 
the investigation of heat exchange requirements of prob- 
abilistic Turing machines. The heat exchanges for non- 
deterministic processes have previously been analyzed by 
Penrose4. They are also investigated in detail by Maroney 
for the purpose of generalizing LEP.^*I2 However, as it is 
shown in this article, their entropic restrictions on heat 
exchanges are weaker than theorem 1. 

The organization of the article is as follows. In sec- 
tion QT] first, the notation used for describing the sys- 
tems and the processes is defined. Then, concentrating 
on the constant temperature case, the proof of theorem 
1 is given for both classical and quantum systems. Fi- 
nally, the completeness of these relations is proved by 



constructing an arbitrary process on a simple system. In 
section IIII1 the connection between the relations ((T|) and 
previously known relations such as LEP is discussed. Sec- 
tion [IV] contains a brief conclusion. Finally, Appendix [Al 
contains the proof of the theorem for the case of various 
baths at different temperatures. 

II. RELATIONS BETWEEN HEATS DUMPED 
A. Definitions and Notation 

Consider a system S which can have two possibly iden- 
tical configurations which will be called initial and final 
configurations. In the initial configuration, the system 
has n distinct logical states described by a Hamiltonian 
His and is in equilibrium with a heat bath Bi at tem- 
perature Ti. In the final configuration, the system has 
m distinct logical states described by Hamiltonian -ff/s 
and is in equilibrium with a bath Bf at temperature Tf. 
The two baths might be identical. The system can be 
microscopic, but all of the baths must be macroscopic. 

A process brings the system from the initial configu- 
ration to the final configuration, while in the meantime 
it brings the system into contact with various heat baths 
Bj at different temperatures Tj. Any process satisfy- 
ing the feature (A) can be described as a time- dependent 
Hamiltonian 

H(t) =Hs(s;t)+H int (s;bi,b 2 ,...,bf,t) + Y^ h b 3 (bj ) 

j 

(2) 

where s and b., denote the phase-space coordinates of 
the system S and the bath Bj respectively, Hba is the 
Hamiltonian of bath Bj and Hi nt describes the detailed 
coupling of the system to the baths. If the process starts 
at time ti and ends at tf, the total Hamiltonian reduces 
to the corresponding expressions for each configuration. 
In other words, for t < ij, 

H(t) = H iS (s) + H itint (s; bt) + J2 h b, (bj) , (3) 

3 

and for t > t/ 

H{t)=H f s{s)+H fM ttehf) + Y,HB i fri) ■ (4) 

3 

In both of these configurations, it will be assumed that 
the coupling terms -ff c ,int(s, b c ) (c = i,f) are negligi- 
bly small. If Hf^i n t is identically zero, then sufficiently 
long times should pass during the process for an effective 
equilibration. 

The phase space of the system is divided into disjoint 
regions corresponding to each logical state in such a way 
that any microstate s belongs to one and only one logi- 
cal state. This division can be described with indicator 
functions 9 ca which are defined as 9 ca (s) = 1 when s be- 
longs to logical state a in configuration c and 9 ca (s) = 
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otherwise. As a result, X)a^«i( s ) = 1 must be satisfied 
for all s. 

Miscellaneous thermodynamical quantities for each 
logical state must be defined in the canonical ensemble 
by using only those microstates that belong to the given 
logical state. Thus, using the indicator functions, the 
partition function and the internal energy are 

Z ca = J 6 ca (s)e- H ^/ k ^ds , (5) 

U ca = J- / H cS (s)6 ca (s)e- H ^( s y k ^ds , (6) 

respectively and the free energy, F ca = — fcsT c lnZ CQ ,, 
and entropy, S ca = (U ca — F ca )/T c , are defined accord- 
ingly. 

The logical states must be sufficiently stable in such a 
way that they can be used for information storage pur- 
poses, i.e., once the system is in one of the logical states, 
either it does not make a transition into another logi- 
cal state, or the transition time scales are long compared 
to the process and equilibration time scales. In the for- 
mer case, impenetrable barriers separate the microstates 
of different logical states and the above quantities corre- 
spond to the exact canonical thermodynamical functions. 
In the latter case, high thermal or diffusion barriers with 
long transition times separate the logical states and there 
is some arbitrariness in the choice of the indicator func- 
tions. Once this choice is made however, the thermody- 
namical functions must be defined as above. 

In the proof, the canonical map between the mi- 
crostates of the composite system of S and the baths 
for two given times t\ and t% will be investigated (ti < 
ti < tf < t%). It will be supposed that the system is 
prepared such that, at time t\ it is in logical state a 
of the initial configuration and in equilibrium with the 
bath Bi. The initial distribution of the microstates of 
the composite system depends on the preparation, but 
it must be consistent with the equilibrium assumption. 
Consider the microstate of the composite system at time 
t-2 after the process is applied. Let P(f3\a) be the proba- 
bility that the final microstate belongs to logical state f3 
of final configuration. Let W(f3 <— a) denote the average 
work done given that an a to f3 transition occurs. This 
is the conditional average of the total energy change of 
the system and the baths over all microstates that take 
part in a to f3 transition. Similarly let Qj(f3 <— a) denote 
the average amount of heat transferred to the bath Bj 
given that a to /3 transition occurs. As these quantities 
must be computed in equilibrium, ti — tj must be suffi- 
ciently large for getting thermal equilibrium at time t^. 
Moreover, tj — t\ must be sufficiently large for thermal 
equilibration. Provided that both of these conditions are 
satisfied, the conditional averages of the initial and final 
energies of the system over microstates that take part 
in a to (3 transition are Ui a and Ufp. Therefore, the 
quantities above are related by the first law, 

U ff3 -U ia = W(/3^a)-J2Qj(l 3 ^ a ) ■ (7) 

3 



The total entropy change per ks in a to (3 transition will 
be denoted by sp a and is given by 

k B s 0a = S f p-S ia + J2 Qj{f3 T " a) ■ (8) 

3 1 

In order to interpret these quantities as the total entropy 
change, the baths must be sufficiently large so that the 
heats dumped Qj{f3 <— a) cannot change their tempera- 
ture. 

Before going further, it is worth to consider an impor- 
tant special case, the constant-temperature case, where 
the system is in contact with a single bath at temperature 
T (where Bi = Bj and T t = T f = T). In that case, the 
quantities spa can be given a simple meaning in terms 
of the heat emitted to the bath or the work done on the 
system. These quantities can be expressed in terms of 

Sf3a as 

W(p^a) = F ff3 -F la + k B Ts Pa , (9) 
= -T(Sf -S ia ) + k B Tsp a . (10) 

Note that, when some other thermodynamically re- 
versible deterministic process brings the system from a 
to 0, —T(Sfp — Si a ) is the exact amount of heat that 
must be dumped to the bath. For the current nondeter- 
ministic process however, the heat dumped exceeds that 
reversible contribution by ksTspa. For this particular 
reason, for the constant temperature case, it is tempting 
to call sp a as the dimensionless excess heat (in units of 
ksT) associated with this particular transition. For the 
same reason, it can also be called as the dimensionless ex- 
cess work. Despite what the name may imply, the excess 
heats can be negative for nondeterministic transitions. 

The excess heats are convenient quantities to be con- 
centrated on, because in a cyclic change where a set of 
processes bring the system back to the initial configura- 
tion and state, all reversible contributions in (|10p add up 
to zero. The sum of excess heats then gives the total 
heat dumped to the bath. This is the case in the con- 
text of Maxwell's demon, for example. In this way, the 
inconvenience brought by the asymmetry of the states is 
eliminated. 



B. Proof of Theorem 1 for the Classical Case 

This subsection contains the proof of theorem 1 for 
a classical system. To make the derivation as clear as 
possible, it is assumed that the system interacts with 
a single bath B at temperature T. The proof of the 
general case, which involves various baths at different 
temperatures, is not different from the proof given below, 
but the notation is more involved. For this reason, the 
sketch of the general proof is given in the Appendix. 

The proof relies on the following approximations which 
are justified by the largeness of the bath. (1) At the ini- 
tial preparation stage, the total energy E of the com- 
posite system S + B has an arbitrary distribution in a 



4 



certain range of energies, say between E m i n and E max . 
For any E in this interval, the associated microcanon- 
ical temperature of composite system is approximately 
T. (2) The width of the range, E max — E m i n , is much 
larger than the typical energies for the system. (3) Given 
that the initial energy of S + B at time t\ is E, the 
process parameters P(f3\a), W(f3 <— a) and Q([3 <— a) 
of the process is approximately independent of E for 
E m in < E < E max . The errors made from these ap- 
proximations will get smaller as the size of the bath in- 
creases. The last assumption enables us to investigate 
the relationship between these quantities within the mi- 
crocanonical ensemble formalism. In other words, it is 
supposed that the initial states of the composite system 
has total energy E within the aforementioned range, and 
the microstates are distributed with equal a priori prob- 
abilities. The basic method is to express the transition 
probabilities in terms of certain disjoint phase-space vol- 
umes. A particular addition of these volumes leads di- 
rectly to the inequalities of theorem 1. 

The process induces a canonical transformation <fi on 
the total phase space of S+B from t\ to t<z- Let 4> map the 
initial points (s, b) to the final points (s',b') = <fi(s,b). 
Since <p is canonical, it preserves volumes by Liouville's 
theorem, i.e., the phase-space volume elements are equal, 
dsdb = ds'dW . 

Let n,B(E)dE represent the phase-space volume of the 
bath consisting of points having energy between E and 
E + dE. The corresponding density is 



n B {E) = J 5(E-H B {b))db 



(11) 



As a result of the assumptions made above, the relation 

n B (E-AE) = n B (E)e- AE / kBT (12) 

holds with good accuracy for typical energies AE of the 
system. 

Let N ca (E)dE represent the volume of the total phase 
space of S+B of those microstates having energy between 
E and E + dE and belonging to state-a of system S in 
configuration c. The corresponding density is 

N ca (E) = f dsdb5(E - H cS (s) - H B (b))6 ca (s) 

= J dsn B {E - H cS (s))9 ca (s) 

= n B (E) J d S exp(-H cS (s)/k B T)8 ca (s) 
= n B (E)Z ca . (13) 

In here, the interaction terms representing bath-system 
coupling are dropped assuming that they are negligible. 

Finally, let Mp a {E' , E)dE'dE be the volume of phase- 
space points of S + B that start at state-a with energy 
between E and E + dE and end up at state-/? with energy 
between E' and E' + dE'. The corresponding density can 



be expressed as 

M Pa (E',E) = J dsdbS(E - H iS (s) - H B (b))e ia (s) 

xS(E' - H fs (s') - H B (b'))9 f0 (s') . (14) 

Note that Mp a (E',E) represents the density of points 
where an a to (3 transition has occurred and the total 
energy has increased by E' — E. In that case, E' — E is 
the total work done. 

Now, suppose that the total energy of S+B was E and 
the system was in state a before the process is applied. 
Then, by equal a priori probabilities assumption, S + B 
is in one of the microstates consistent with these restric- 
tions with equal probability. In that case, the probability 
that a transition to state [3 occurs with final energy be- 
tween E' and E' + dE' can be expressed as 



Mf3 a (E', E)dE' 
N ia (E) 



(15) 



Let P((3\a; E) be the probability of making an a to 
(3 transition, irrespective of the work done, and let 
'Pf3a(w, E) be the probability distribution function for 
the work done w in the a to j3 transition. These proba- 
bilities can be expressed as 



P(P\a;E) = J 
Vp a {w\E) 



M f3a {E',E)dE l 
N la (E) 
M 0a (E + w,E) 
P(/3\a;E)N la (E) 



(16) 
(17) 



As discussed above, both of these probabilities have only 
a weak dependence on E, which can be considered to be 
a dependence on the temperature T. For this reason, we 
will write P(/3\a; E) = P{/3\a) and Pp a (w; E) = Vp a {w); 
the dependence on the temperature is assumed, but not 
shown explicitly. Combining these with Q13[) we get 



Mp a (E',E) =n B (E)Z la P(/3\a)Vpa(E' - E) . (18) 

This equation relates the phase-space volume densities to 
the process dependent quantities: the transition proba- 
bilities and the distribution function for the work done 
in individual transitions. 

To obtain the inequalities of the theorem, we sum (Ti"4")) 
over the initial logical states and integrate over E which 
gives 

Ea/ dEMp a [E' ,E) — f' ds'db' 

x5(E'-H fS (s')-H B (b>))9 ff3 (s') , (19) 

where the equality of the volume elements, dsdb = 
ds'db', is used and the prime on the integral sign in- 
dicates that the integration is over all possible final mi- 
crostates. This integral is not over the whole of the phase 
space when the process map <j> is not onto. This may hap- 
pen when infinite, impenetrable barriers evacuate some 
part of the phase space. For example, this is the case 
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for the erasure process considered by Landauer where all 
final logical states other than the standard state are in- 
accessible. For this reason, the right hand side of (|19p is 
smaller than the integral over the whole of phase space, 

Y, [ dEM 0a (E', E) < N f/3 (E') = n B (E')Z f/3 . 

a 

(20) 

This is the first place where an inequality is introduced. 
The equality holds if and only if <j> is onto. 

Using fig]) and changing the integration variable to 
w = E' — E gives 

a J 

Y,P(f3\a){e- w/kBT )f3 a e- {F "*- F ^ )/kBT < 1,(21) 

a 

where (• • - )ff a represents the conditional average over mi- 
crostates that take part in a to j3 transition. Using 
the strict convexity of the exponential function we get 
exp(— (w) p a /kBT) < (e^q>(—w / ksT)) p a where equality 
holds if and only if work done has no fluctuations. Fi- 
nally, using (w)p a = W(fl <— a) and the relation ©, we 
get the desired inequality in Eq. ([!]).□ 

Inequalities are introduced at two points; therefore if 
a process has equalities for all of these m relations, the 
process map <f> on the total phase space should be onto 
and works done should have absolutely no fluctuations. 
We will define Ip , the inefficiency of the process for the 
final state /?, as 

e- 1 ? = J2P{p\a)e-^" . (22) 

a 

The relations ([I]) then imply that all inefficiencies are 
non-negative. Inefficiencies are essentially a combined 
measure of the irreversibility of the process and the fluc- 
tuations in the energy exchanges. As it will be seen be- 
low, by process engineering, some excess heats can be 
reduced and all inefficiencies can be made to vanish. In 
the case of an efficient process, where all Ip = 0, it is 
not possible to decrease any of the excess heats with- 
out increasing some other excess heat corresponding to 
a transition with the same final state. By the discus- 
sion above, efficient processes have no fluctuation for the 
excess heats and the process map is onto. 

It might be interesting to view the derivation above 
from the perspective of the time-reversed process. The 
time reversal of a process is always well defined in the 
Hamiltonian description; it is simply given by the Hamil- 
tonian 7i(t) = H(—t) and brings the system from the fi- 
nal configuration to the initial one. Basically, one needs 
to carry out the same actions in reverse order. Below, the 
associated quantities for the reverse process will be indi- 
cated by tildes. For simplicity, consider the case where 4> 
is onto. As the reversed process map is <j) — <t>~ j the den- 
sities in dHJ) are related by M a p(E,E') = Mp a (E',E). 



Invoking (fTSf . the following relation between the proba- 
bility distributions can be found 

P(a\/3)-p af3 (-w) = P(p\a)V 0a (w)e- w I^^SL . (23) 

z fP 

That relation can be made simpler by expressing it in 
terms of a microstate dependent variable for the excess 
heat, s = (w + Fi a — Ffp)/kBT. Note that {s)p a = 
sp a and s has no fluctuations if and only if w has no 
fluctuations. This variable can also be expressed as a 
function of the microstate coordinates as 

s = ^(E^Ms)-E F /^/7(s') 

7 7 

+H fs (s') + H B (b') - H iS (s) - £r fl (b)) , (24) 

for the forward process. Therefore, for the reversed pro- 
cess with initial point at (s',b'), the value of the cor- 
responding variable is — s. The relationship between the 
probability distributions of s for forward and reverse pro- 
cesses then becomes 

P{a\P)V' aP {-s) = P{P\a)V' 0a {s)e- s , (25) 

where V denotes the distribution function for that quan- 
tity. The inequalities ([T]) are obtained by using the fact 
that the total probability for the reversed process is 1. 



C. Sketch of the Proof for Quantum Systems 

For quantum systems, there is a problem involved in 
the definition ([8]) of spa. To provide a consistent defi- 
nition of these quantities, it should be assumed that the 
system starts from a definite initial logical state a and 
when the process is completed a projective measurement 
of the final logical state is carried out. Provided that this 
is done, the average heats dumped to the baths can be 
computed from the expectation value of the respective 
Hamiltonians of the baths and therefore sp a are well- 
defined quantities. It is important that such a final mea- 
surement stage takes place to eliminate the possibility of 
having final microstates in a superposition state of vari- 
ous logical states. If this is the case, the inequalities of 
theorem 1 are valid. 

The proof for this case is not different from the clas- 
sical proof given above. There is only a change in the 
terms used. Instead of a canonical map, there is now an 
isometry V that maps the initial microstates into the fi- 
nal ones, which preserve the dimensions of the subspaces 
(yty = 1). As above, the map V does not need to be 
onto, i.e., it does not need to be unitary. In that case, 
VV' is a projection operator on the accessible final states 
and therefore VV^ < 1. 

The operators 9 ca are now a complete set of orthogo- 
nal projections (%2 a ca — 1) which commute with the 
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respective Hamiltonians H c s- All of the phase-space den- 
sities defined above can now be expressed as 

n B (E) = tr 8{E~H B ) , (26) 

N ca (E) = tr (6(E - H cS - H B )9 cce ) , (27) 

Mp a (E', E) = tr (VS(E - H lS - H B )6 la V^ 

xS(E'-H f s-H B )6 f0 ) . (28) 

The identity in Eq. (|18[) remains the same. The inequal- 
ity in Eq. (20|) follows by using the fact that VV^ < 1 
and the same convexity argument leads to the final proof. 

D. Completeness of the Relations in Theorem 1 

The set of inequalities (Q]) are also complete. In other 
words, one cannot find any more restrictions between the 
transition probabilities and entropy increases that cannot 
be derived from the given inequalities. This completeness 
statement is captured in the following theorem. 

Theorem 2. Let P((3\a) be some transition probabili- 
ties and sp a (0 = 1, . . . , m, a = 1, . . . , n) be some num- 
bers such that none of spa are infinite and the quantities 
1/3, which are defined as 

J2 P (P\ a ) e ~ S ' 3a = > ( 29 ) 

ot 

are nonnegative. Then, there is a system S having initial 
and final configurations with respectively n and m logical 
states, and there is a process having feature (A) on that 
system such that P(f3\a) are the transition probabilities 
and s/3 a are the total entropy increases for the respective 
individual transitions. 

Proof: The system that will be chosen for this pur- 
pose is essentially Szilard's one molecule gas»^i It is a 
classical system which is composed of a single molecule 
inside a box with a total volume V and in contact with 
a heat bath at temperature T. The box is divided by 
impenetrable walls into n regions having volumes Ri a V 
(a = 1, 2, . . . , n) for the initial configuration and into m 
regions having volumes Rf a V (a — 1,2, ... ,m) for the 
final configuration. Here R ca are positive numbers with 
^l a Rca = 1- The region that the molecule is located 
represents the logical state and all of them are separated 
by impenetrable barriers. 

Although it is not essential for the proof of the the- 
orem, it can be supposed that different constant poten- 
tials are applied to each region. If the molecule is an 
ion, this can be achieved by surrounding each region by 
a metallic sheet and applying different constant electro- 
static potentials to them. Let u ca be the potential energy 
in region-a for configuration c. This potential does not 
affect the motion of the molecule; its sole purpose is to 
adjust the internal energies of states to different values. 
The internal energy and entropies of the molecule can be 
expressed as 

U ca = u ca + f{T) , (30) 

Sea = 

\n(R ca V) + g(T) , (31) 
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FIG. 1: The steps of the process on a single molecule system 
with n — 2 initial and m = 3 final logical states. The system 
is prepared in initial configuration and (1) potentials of each 
region are lowered to zero, (2) additional walls are inserted, 
(3) volume of each region is expanded or compressed, (4) some 
parts of the walls are removed, (5) walls are aligned at con- 
stant volume, (6) each region is compressed adiabatically by 
pistons from left, (7) small holes opened on the pistons to let 
the molecule expand freely towards the left part, and finally 
(8) pistons are removed and the potentials of each region are 
increased. 

for some functions / and g of temperature. Therefore, by 
selecting u ca , R ca and the volume V appropriately, it is 
possible to set the values of internal energy and entropies 
to anything that we choose. As it will be seen below, 
these values do not have any effect on the end results; 
they are completely arbitrary. 

Consider the following process applied on this gas. 
Note that all of the individual steps of this process can 
be carried out without knowing where the molecule is. 
Therefore, it is a process that satisfies the condition (A). 
At each step, only the value of the work done is com- 
puted; the heat exchange with the bath can be computed 
from the first law at the final stage. 

Prepare the box in the initial configuration. The 
molecule can be in any region. 

(1) First, decrease the potential of each region from Ui a 
to zero. No heat exchange with the bath occurs in 
this step and if the molecule is in region-a, then 
the work done on it is W± — —Ui a . 

(2) Next, divide each region into smaller volumes by 
inserting secondary walls, so that region-a (which 
had volume Ri a V) is divided into m smaller regions 
with volumes Vp a — P(f3\a)Ri a V. There will be 
nm such smaller regions at the end, some of which 
might have zero volume. This step does not require 
an expenditure of work. If molecule was in region- 
a, then the probability that it will appear in the 
region-/3a with volume Vp a is P((3\a). 

(3) Next, slowly change the volume of each of these 
nm regions such that the volume of region-/3a is 
changed from Vp a to 

V^=R fp P{P\(x)e^- s ^V . (32) 
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The work done on the molecule if it is in region (3a 
is 



W 2 = k B T\A 

V f3a 



Ri, 



ksT In — Ip + sp a 



(33) 
(34) 



Note that no problem arises if a particular volume 
Vp a is zero, since the probability that the molecule 
is there is zero. 

(4) Now, for each (3, connect the regions with indices 
/31, (32, (3n into a single connected region with 
volume 



IP 



V 



(35) 



by removing some parts of the walls. This step also 
does not require an expenditure of work. 

(5) Next, move the walls slowly in such a way that each 
region has still the same total volume, but at the 
end, the walls become arranged just like the final 
configuration. This step is needed for aesthetical 
reasons only. As long as the volumes of each region 
remain the same, there is no work done. 

(6) Now, slowly compress each region-/? from volume 
Vp to Vp — e~ Il3 Vp by a movable piston. If the 
molecule is in region-/3, the work done on it is 



Wa = k R T\n 



= k B TI B 



(36) 



Note that when all expansion and compression steps 
are done infinitely slowly there will be no fluctuations in 
the excess heats. This implies that the process map <f> is 
not onto if some inefficiencies are non-zero. It is obvious 
that the step 6 is responsible for this effect as it caused 
the evacuation of some part of the total phase-space. 

There is a stronger form of this theorem concerned with 
the general case where the system interacts with different 
heat baths at different temperatures. It states that the 
inequalities fl} capture also all restrictions that are sat- 
isfied by individual heats dumped to the baths. In other 
words, if each Qj{(3 <— a) are chosen such that the quan- 
tity sp a defined in Eq. JS]) satisfies {!]), then it is possible 
to construct a process where the average heats dumped 
into each bath in each possible transition are given by 
the chosen quantities. The proof of this statement is not 
complicated; one only needs to change the step 3 of the 
process described above in an appropriate way. For this 
reason, the construction of the process and the proof of 
this stronger statement are left to the reader. 



III. DISCUSSION 

Since the inequalities in (fT]) express all restrictions that 
can be placed on the excess heats, they contain other 
powerful relations. In this section, it will be shown that a 
number of relations that have been obtained by different 
researchers in different contexts can be derived from these 
inequalities. The simplicity of these derivations is an 
indication of the power of these inequalities. A few other 
implications of these relations are also discussed at the 
end. 



(7) Now, open a hole on the pistons used in step 6 (or 
remove them suddenly) , and let the molecule freely 
expand from volume Vp back to volume Vp. The 
work done on the molecule is zero again. The steps 
6 and 7 are needed to increase the total entropy by 
the desired amount. Note that step 7 is possible 
only if all Ip are non-negative. 

(8) Increase the potential energy of each region from 
zero to Ufp. The work done if the molecule is in 
region-/? is W4 — Ufp. 

At this point, the system is brought to the final con- 
figuration. If the molecule is initially in region-a, then 
it will appear in region-/3 with probability P((3\a). The 
total work done when this is the case is 



W(j3<-a) = Y, W i 



(37) 



Ufp - u ia + k B T I In 



R>ia 



f0 



s 0a (38) 



= F f p - F ia + k B Tsp a . (39) 
Therefore, the excess heat of that transition is spa.O 



A. Penrose's Lower Bounds on Excess Heats 

As each term in (fT]) has to be less than 1 , the following 
lower bound for the excess heats in terms of the corre- 
sponding transition probabilities can be given 

sp a > In P((3\a) . (40) 

Hence, some excess heats can be negative if the corre- 
sponding transition probabilities are less than one. This 
inequality has been first obtained by Penrose in his treat- 
ment of nondeterministic processes.— Inequalities in ([T|) 
enable us to see how the lower bound above can be ac- 
complished. If, a is the only initial state that leads to 
the final state (3, then this lower bound can be achieved. 
If possible initial states are more than one, then it is not 
possible to achieve the lower bound, but it is possible to 
approach arbitrarily close to it, at the expense of increas- 
ing sp\ for all A ^ a. 

A bound on the fluctuations of the excess heats can 
also be obtained if they are Gaussian. In that case, the 
average of exp(— s) for an a to (3 transition is 



s )f3 a = exp 



-Sf3a 



(41) 
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where As^ a is the standard deviation of the fluctuations. 
The upper bound on the standard deviation can then be 
obtained from ([25| as 

A S/3a < ^2 (s Pa -\nP((3\a)) . (42) 

In other words, the fluctuations are necessarily sup- 
pressed if the lower bound in (14T))) is approached. How- 
ever, if the fluctuations are not Gaussian, it is not possi- 
ble to find such bounds on the standard deviation. 



B. Landauer's Erasure Principle 

Consider a Landauer erasure process where all initial 
states end up in the same final state, e.g., in state /3 = 1 
(i.e., a restore-to-1 process). In this case, P{f3\a) = Spx, 
all inefficiencies except I\ are infinite and for the final 
state 1 we have 

er h =^e- Sl « < 1 . (43) 

a 

This relation has been first obtained by Szilardii in con- 
nection with his membrane model. Although Szilard as- 
sociated changes in entropy with the measurement pro- 
cess, a correct interpretation would connect it to the era- 
sure as discussed in Ref. [l2|. 

In order to obtain a lower bound on average ex- 
cess heats, it is convenient to introduce the Legendre 
transform of the information-theoretic entropy function, 
tr(p) = — J2 a Pa^ n Pai 01 a probability distribution p = 
(pi,p 2 , ■ ■ ■ ,Pn)- Let x = (xi,X2, ■■ ■ , x n ) be an n-tuplet 
of real numbers. The Legendre transform of a is 




where the minimum is taken over all possible probabil- 
ity distributions. The definition above implies that the 
inequality 

^p a x a - cr(p) > J(x) (46) 

a 

is satisfied for all p and x. Moreover, the inequality be- 
comes an equality if and only if p a — exp( J(x) — x a ). 
Taking (|46[) gives J(x) = 1% > and 

^p a si a > o-(p) . (47) 

OL 

If the system is a memory device and information is coded 
such that the state a appears with probability p a , then 
<r(p) is the measure of information in nats stored by the 
device. If the same resetting process is applied on the 



device (or on an ensemble of such devices) irrespective of 
the state, then the left-hand side of (|4T|) is the (ensemble) 
average of the excess heats. Therefore, this inequality 
states LEP. Note that the inequality holds for any prob- 
ability distribution p. As a result, it can be considered 
as an infinite set of mathematical relations between si a . 
Interpreted in this way, (|4"T|) is equivalent to (|4"3"f . 

C. Generalized Landauer Principle 

The derivation above can be repeated for an arbitrary 
process as well, which leads to the generalized form of 
Landauer's principle. Consider a process with transition 
probabilities P((3\a) and excess heats sp a . If the system 
is prepared in the initial configuration with probability 
distribution of states being p a , then 

Pf, = ^P(J3\a)p a , (48) 

ot 

is the probability distribution of final states and 

s = J2 s 0c t P(0\a)p a , (49) 

a/3 

is the average excess heat. A lower bound on s can be 
obtained as follows. 

First, consider a single final state (3. If we take x a — 
[spa — lnP(P\a)] (for a = 1,2, ...,n), then it can be 
seen that J(x) is equal to the inefficiency I p. Using the 
inequality (|46[) . we can then write 

53[a /Ja -lnP09|a)]gJ,-ff(q')>J(x)=/ /3 , (50) 

a 

which is valid for all probability distributions q'. Us- 
ing the particular distribution q' a = P{(3\a)p a /pp in this 
inequality gives 

J2l^ a P(l3\a) Pa +P(l3\a)p 

a 

(51) 

Finally, summing over the final states (3 gives 

S + ff(p)-£r(p) >Y^PP T P ■ ( 52 ) 



As each inefficiency has to be non-negative, we get the 
desired result, 

S > a(p) - a(p) . (53) 

This is identical in content with the non-decreasing prop- 
erty of Penrose's statistical entropy^ It is termed as the 
generalized Landauer principle by Maroney2ii£ as it re- 
lates the average excess heat emitted to the environment 
to the change in the information-theoretic entropy of 
logical-state distributions. 

As the properties of the process (namely P(f3\a) and 
Sfj a ) are independent of the choice of the distribution p, 



9 



it is possible to view (|53|) as an infinite set of inequalities, 
one for each possible distribution p, placed on the pro- 
cess. Although they have clear physical interpretations, 
these inequalities are weaker than and not equivalent to 
the main inequalities of this article given in ([1]). The rea- 
son is that, all expressions in (153|) is essentially a com- 
bination of s a = J2p P{P\ot)sf3 a , the final-state average 
of the excess heat when the initial state is a. For this 
reason, (f53|) places restrictions only on the averages s a 
and not on the individual excess heats associated with 
every transition. 

Moreover, ([1]) and (|53p imply an opposite dependence 
between individual excess heats of transitions. As an 
example, consider a process where all inefficiencies are 
0, and one of the excess heats, say sp ai is desired to be 
decreased. Relations ([55)1 alone implies that, this can 
be achieved by increasing another excess heat with the 
same initial state (e.g., s^ a with fi ^ /3) in such a way 
that s a remains the same. However, according to (TT]), this 
is not possible; sp a can be decreased only by increasing 
another excess heat with the same final state (e.g., sp\ 
with A / a). For this reason, the relations (|S"3"|) are not 
sufficient for this kind of process engineering problems. 

D. Controlled Processes 

A trivial application of the inequalities in (fTJ) is to 
controlled processes. Here, the process to be applied 
on a system S is determined based on the logical state 
of another system C, the controller; but the controller 
does not change its logical state during the process. The 
controllcd-NOT gate is a well-known example. The re- 
strictions on the heat exchanges with the environment 
for such processes can be analyzed simply by imposing 
these restrictions for each individual process on S. To be 
precise, suppose that when the state of the controller is 
k, the process A^> is applied on S which has transition 
probabilities P^ k \j3\a) and excess heats sj^. For this 
case, the controlled process applied on the combined sys- 
tem S + C has the following transition probabilities and 
excess heats 

P(f3j\ak) = 8 jk P^ k \(3\a) , (54) 
S(3k,uk = s~pl . (55) 

The inequalities |T]) for the controlled process then are 
Y,P {k) (P\a)e- s ^ < 1 , (56) 

which must hold for all f3 and k. Therefore, the controlled 
process satisfies the inequalities ([lj if and only if every 
individual process A^ k ' applied on S satisfies the same 
inequalities. 

An immediate application of the result above is to the 
measurement processes. In this case C is the system 
whose state will be measured and S plays the role of 



the recording instrument. Before the measurement, S 
must be prepared in a standard state, say a = 1. The 
measurement is then a controlled process as above, where 
the individual process changes the state of S from 
a = 1 to /3 = k with certainty. As this is the only 
necessary requirement, it is possible to construct A^ 
as a deterministic and logically reversible process and 
choose Sj. fe ] = 0. This controlled process changes the 
logical states of S + C from (1, fe) into (k,k), i.e., the 
state of C has been copied into S, and no excess heat is 
transferred to the environment in doing this. This is a 
simple, but general demonstration of the principle first 
stated by Bennett^, i.e., thermodynamically reversible 
measurements can be done provided that the recording 
instrument is initialized in a standard state. 

E. Some further Bounds and Processes with 
Doubly Stochastic Transition Probabilities 

The inequalities in |T]) give the relation between excess 
heats of the same final state, e.g., it gives a restriction 
between spi, . . . , sp n . The following lower bound on the 
largest of these quantities can be easily deduced 

max Sl3a > In P(P\a)^J , (57) 

where the equality applies if and only if spi = . . . = Sfj n . 
From here, it is possible to show that the largest excess 
heat of all transitions, max a ^ sp a , is bounded from below 
by ln(n/m). In other words, at least one excess heat 
should exceed that bound. This result, together with a 
corresponding one for final-state averages of excess heats, 
is also contained in the following proposition. 
Proposition. The following are equivalent. 

(a) spa < ln(n/m) for all (3 and a, 

(b) s a < ln(n/m) for all a, where s a — P{j3\a)sp a 
denotes the final-state average of the excess heat for the 
initial state a, 

(c) Sfj a — \n(n/m) for all (3 and a, all inefficiencies are 
If} = and the transition probabilities satisfy 

J2P(f3\a) = - . (58) 

a 

Proof: As the implications (c)=>(a)=^(b) are trivial, 
we only need to show (b)=^(c). Suppose that (b) holds. 
First, consider a fixed a. Strict convexity of the expo- 
nential function leads to 

- < e- s « < P{(3\a)e- S ^ (59) 

where the rightmost inequality is an equality if and only 
s fia = s„ for all /3. Next, sum these inequalities over the 
initial state a and apply ([T|) to get 

to < J2 e ~ Sa - J2 p ^ a ^ e ~ Sfia - J2 1 = m ■ ( 60 ) 

a a/3 
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As the leftmost and rightmost sides of this chain are 
equal, all of the individual terms are equal to each 
other. Therefore, all inequalities that are used to ob- 
tain it must have been equalities as well. As a result, 
spa = s„ = ln(n/m) for all a and (3,1/3 = for all /3 and 
Eq. ((Ml) follows from these. □ 

At this point, it is worth to concentrate on a particular 
special case, the case where the initial and final config- 
urations of the system are identical (and hence n = m). 
Most applications, for example memory elements used in 
computation, fall under the scope of this case. A minor 
result that follows from the proposition for this case is 
that for any process there should be a transition with a 
non-negative excess heat. Similarly, there should be a 
state a for which s a > 0. 

The processes that satisfy the conditions of the propo- 
sition have some remarkable properties that should 
be mentioned. These are processes that have doubly 
stochastic transition probabilities 



following transition probabilities 



£p(/3|a)=^P(c 



= 1 



(61) 



In this case, it is possible to construct the process in 
such a way that all excess heats are zero which necessarily 
implies that the process is also efficient for all final states. 
The proposition is stating that any process that has non- 
positive excess heats for all transitions should be such a 
process. 

The excess heats of these processes have absolutely no 
fluctuations. There are no macroscopic fluctuations be- 
cause all excess heats are uniform, spa = 0, and there 
are no microscopic fluctuations because the process is 
efficient. If the logical states of the system are also sym- 
metric, i.e., they have equal equilibrium energy and en- 
tropy, then absolutely no heat exchange occurs with the 
bath and there is no work done irrespective of the ini- 
tial and final state. A contact with a bath is not even 
necessary to implement the process. 

Moreover, successive application of two such processes 
yields a process that has the same feature. For this rea- 
son, they might find application in the implementation 
of probabilistic Turing machines. If the individual com- 
putation steps of such a machine have doubly stochas- 
tic transition probabilities, then it is possible to run the 
whole computation without any heat exchange with the 
environment and without any work done (assuming that 
the states are symmetric). 

Furthermore, any probabilistic Turing machine can be 
adapted to have doubly stochastic transitions. Consider, 
for example, a step of the computation where a set of 
memory elements S with n logical states undergoes a log- 
ical operation with transition probabilities P((3\a), which 
may not be doubly stochastic. Let A be an ancillary de- 
vice having n logical states, which is initially in state 
i = l. Consider an operation on S + A which has the 



P'(0j\ai) = 



ri 2 (n-l) 



if i = 1, 
if i 5*1, 



(62) 



where rp = ^ Q P{[3\a) and the roman letters denote 
the logical states of A. It can be seen that P' is doubly 
stochastic and P'{(3j\oil) = P((3\a). In other words, 
if A starts in state 1, then same logical operation is ob- 
tained on S. In order to make the whole computation 
doubly stochastic, different ancillaries with the same ini- 
tial state have to be used at each computation step. This 
discussion essentially shows that probabilistic Turing ma- 
chines with doubly stochastic transition probabilities can 
have the same computing power as any other probabilis- 
tic machine, at the expense of using a larger memory 
space. 

However, it should be kept in mind that probabilistic 
Turing machines can also be designed to have determin- 
istic computation steps where randomness is introduced 
by an additional input tape containing random symbols. 
They are more manageable for thermodynamically re- 
versible computation since they can be adapted to have 
logically reversible computation steps as in Ref. [l3|. In 
that case, they will also be able to erase all interme- 
diate results of the computation, except the input, the 
desired output and the random tape, without any heat 
exchange. In addition, the symbols on the random tape 
can also be generated and subsequently erased without 
any heat exchange (e.g., inserting walls into a container 
containing single molecule for generation; removing the 
walls for erasing). As a result, there is no problem in 
doing probabilistic computation in a thermodynamically 
reversible way. 



IV. CONCLUSIONS 

A complete set of inequalities has been obtained that 
places restrictions on the excess heats and the transi- 
tion probabilities of any nondeterministic processes hav- 
ing feature (A). These inequalities can be interpreted as 
a generalized form of both Penrose's bound P0")) for non- 
deterministic processes and Landauer's bound (|43)) for 
resetting operations. Their unique power comes from 
the completeness property. As a result, just like these 
two classical results, any relation between excess heats 
and transition probabilities can be obtained starting from 
these inequalities. 



APPENDIX A: PROOF OF THEOREM 1 FOR 
VARYING TEMPERATURE CASE 

Consider the general case where the process brings the 
system into contact with various heat baths at different 
temperatures. The notation is described in subsection 
III Al and the proof follows the same lines of the one given 
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in siibsection lll Bl The quantum case is assumed because 
of its slightly better notation. It is also assumed that 
the baths Bi and Bf are distinct; the case where they 
are identical is not considered separately as the proof 
is altered only slightly. Suppose that there are N baths 
and the initial and final baths are Bi = B\ and Bf = Bn 
respectively. 

Let n Bj (E) denote the density of states of bath Bj at 
energy E which is defined by Eq. (j26|) and N ca (E) denote 
the density of states of S and B c for logical state a at 
configuration c = i, f. These are defined as in Eq. (|27|) for 
the appropriate bath and can be expressed as N ca (E) = 
n Be (E)Z ca (T c ). 

Let Ei, ... , En be a set of energies chosen in the ap- 
propriate temperature range of the corresponding baths 
and E[, . . . , E' N be another set of energies. To sim- 
plify the notation, the shorthand £ = [E\, . . . , En) and 
£' = (E[, . . . , E' N ) will be used. Consider the following 
set of microstates of the system and the baths. 

i. Initially, before the process is applied 

(a) S is in logical state a, 

(b) S + Bi has energy in interval {E\,E\ + dE\), 

(c) bath Bj (j 7^ 1) has energy in interval 
(Ej , Ej + dEj ) ; 

ii. and after the process is completed 

(a) S is in logical state (3, 

(b) S+Bf has energy in interval (E' N , E' N + dE' N ), 

(c) bath Bj (j ^ N) has energy in interval 



■dE'j) 



The corresponding "density of states" for such mi- 
crostates is given by 



tr (V 



Mp a (£'; £) = 
8 ia 8(E 1 — Ha 



9 ff3 S(E' N - H 



fs 



-H B 
H Bf ) 



)Uj^(Ej 



-H B 



(Al) 



where V is the isometry corresponding to the time devel- 
opment of the state from t\ to t%. It satisfies V^V = 1 
and VV^ < 1. Note that, E\ and E' N include also the 
energy of the system S, while all the other energies rep- 
resent only the corresponding bath's energy. 

The density of states in (|A1|) can be easily related 
to the process dependent probabilities. First, note that 
by equal a priori probabilities, all microstates satisfy- 
ing condition (i) above can be represented by the density 
matrix 



P = gOiadiEi 



where 



H iS - H Bi )l[6(Ej - H Bj ) (A2) 



Z 



E 



N 

J d£'M 0a {£'; £) = Z la (T x ) J| n Bj (E 



(A3) 



As a result, p' = VpV^ is the density matrix when the 
process is completed. Projective measurements by {Ofp} 
will result in the transition probabilities 

P(f3\a) = tr p'Bff, = ~J d£'M [ja {£'- £) . (A4) 



Finally, if collapse to /3-state occurs in that measurement, 
the final density matrix is p" = 8fpp'0fp/P(P\a) from 
which the distribution of final energies of the baths and 
the system, i.e., the function Vp a {£' — £) can be com- 
puted. The quantity defined in (|A1|) contains this infor- 
mation. As such, it can be expressed as 

JV 

Mp a {£';£) = P(fi\a)V Pa (8' - £)Z ia (Ti) l[n Bj {Ej) , 

i=i 

(A5) 

where again it is assumed that these probabilities have 
a weak dependence on initial energies £ and it is sup- 
posed that this dependence can be taken as an implicit 
dependence on the temperatures. 

Now, sum and integration of M over initial state and 
energies gives 



N 

d£M Pa (£';£)<Z f p(T N )l[n Bj (E'j) , (A6) 

a J 3 = 1 

where the inequality is introduced taking by into account 
that VV^ < 1. Using (|A5[) . the last inequality can be 
written as 



]TP(/?H J d£Vp a (£'-£) 



Z la (T 1 ) TjiiB^Ej) 



n 



Z JP (T N )^n B] {E' 3 ) 



5>(/3|a) (e-*)^ < 1 , (A7) 



where now the variable s is given by 



c _ F f(3 (T N ) F fa (TQ | y E'j-Ej 
k B TN k B Ti ^— f k B Tj 



(A8) 



Since 



(E' x -Ei)fs a = Q l (f3^a)-U ia , (A9) 
(Eff — En) /3a = Qf(P*-a) + U f ii , (A10) 
(E'j - Ej) Pa - Qj(0 <- a) (j ? 1, N) ,(A11) 

it can be seen that (s)/3 a — sp a . The inequalities in 
dU then follow by using the convexity of the exponential 
function. The detailed relation in Eq. between the 
probability distributions for reversed and forward pro- 
cesses continue to hold in this case as well. 
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